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Abstract 


>y 

Theory  and  computation  techniques  of  the  various 
types  of  generalized  inverses  of  matrices  which  have 
polynomial  elements  x,  y,  z...,  etc.,  are  presented. 

A  simple  algorithm  for  computation  of  generalized  in¬ 
verses  of  a  constant  matrix  is  established,  and  then 
applied  to  the  case  of  matrices  having  polynomial 
elements  in  several  variables.  Reduction  of  a  matrix 
to  its  Smith  form  over  the  ring  of  polynomial  elements 
in  several  variables  is  presented.  A  simple  algorithm 
for  investigation  of  the  system  Ax  =  b  in  case  of 
constant  and  nonconstant  rank  of  A  is  presented. 
Application  of  generalized  inverses  to  solve  more 
general  matrix  equations  such  as  Lyapunov  and  Riccati 
equations  is  studied. 

r- 


V 


Introduction 


The  problem  of  solving  m  linear  equations  in  n 
unknowns  over  the  field  of  complex  numbers  can 
be  formulated  as  a  matrix  equation  Ax  =  b  ,  where  x 

is  the  column  vector  of  'unknowns'.  If  A  is  a  non¬ 
singular  square  matrix,  then  the  linear  matrix  equation 
has  an  immediate  solution,  given  by  x  =  A^b.  If, 
however,  A  is  a  singular  square  matrix  or  in  general 
nonsquare  matrix,  then  the  classical  inverse  of  A  is  not 
defined.  In  this  case  it  is  possible  to  find  a  similar 
representation  of  the  solutions  of  the  system  Ax  =  b 
using  generalized  inverses  associated  with  the  matrix. 

In  1920,  Moore  22  published  a  theory  for  the 
generalized  inverses  in  abstract  form.  In  1955, 

Penrose  published  a  theory  for  a  generalized  inverse 
of  any  matrix  with  complex  elements.  He  showed  that 
for  any  matrix  A  whose  elements  are  complex  numbers 
there  exists  a  unique  generalized  inverse,  called 
Moore-Penrose  inverse,  A+  .  This  unique  matrix  A+ 
is  used  to  find  the  minimum-norm  solution  to  the  least- 
squares  problem  | |Ax-b| |  =  minimum.  Penrose  put  four 
conditions  which  must  be  satisfied  by  the  generalized 
inverse  A*. 

Since  1955,  the  concept  of  generalized  inverse 
has  been  modified  to  include  more  general  generalized 
inverses  which  satisfy  only  some  of  Penroses's  four 
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conditions.  Those  generalized  inverses  are  not  unique. 

The  most  important  generalized  inverse,  which  is  enough 
to  investigate  the  system  Ax  =  b,  is  an  A^  inverse  which 
satisfies  the  first  Penrose  condition,  AA^A  =  A. 

Chapter  I  of  this  thesis  contains  the  theory  of 
all  types  of  generalized  inverses.  The  theory  of 
generalized  inverse  which  produces  a  special  solution 
for  the  system  Ax  =  b,  will  be  studied.  In  Chapter  II, 
the  computation  techniques  of  the  generalized  inverses 
will  be  established.  A  simple  algorithm  for  computing 
all  types  of  generalized  inverses  will  be  introduced. 

In  Chapter  III,  the  theory  of  generalized  inverses  of 
matrices  with  polynomial  elements  will  be  discussed. 

The  conditions  under  which  a  matrix  with  polynomial 
elements  has  a  generalized  inverse  will  be  investigated. 
The  problem  of  finding  a  solution  of  the  system  Ax=b, 
where  A  has  polynomial  elements  in  more  than  one 
variable,  will  be  studied.  Chapter  IV  will  contain 
application  of  generalized  inverse  for  solving  more 
general  matrix  equations  such  as  Lyapunov  and  Riccati 
equation. 

Throughout  this  thesis  capital  letters  denote 
matrices,  while  lower  case  letters  represent  scalars. 
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I  Theory  of  Generalized  Inverses  of  a  Matrix 


Definition  of  Generalized  Inverses 

The  main  purpose  of  this  chapter  is  to  establish 
the  main  concepts  of  generalized  inverses  of  a  matrix 
with  complex  elements.  Any  m»n  matrix  A  having  real 
or  complex  elements  satisfies  the  following  four  axioms 

(I)  AXA  =  A 

(II)  XAX  =  X 

(III)  (AX)  is  hermitian 

(IV)  XA  is  hermitian 

where  X  is  a  unique  (n*m)  matrix.  This  unusual  fact 
was  proved  Moore  £  22  j  in  1920  in  abstract  form  and 
established  as  shown  above  by  R.  Penrose^  25  ]  in  1955. 

The  introduction  to  this  field  of  interest  will  be 
given  in  special  cases  with  respect  to  the  matrices 
which  satisfy  axiom  I;  axioms  I,  II;  axioms  I,  II,  III, 
and  finally,  axioms  I,  II,  IV.  These  more  general 

cases  will  be  denoted  by  A^,  A.^2'  Ai,2,3r  Al,2,4' 
respectively,  which  in  general,  are  not  unique  for 
given  matrix  A. 

Below,  is  given  a  list  of  those  types  of  generali¬ 
zed  inverses.  Unfortunately,  there  are  a  lot  of 

/ 

different  notations  for  each  type,  but  in  this  thesis 
we  will  use  the  notation  given  above. 


A^-Inverses:  every  matrix  possesses  at  least  one 

inverse,  sometimes  termed  "generalized_inverse" ,  A~ 
or  g-inverse. 

A1  2  3  4“Inverses  Every  matrix  whose  elments  are 
real  or  complex  numbers  possesses  a  unique  A.  0  -  . 
inverse  called  "Moore  Penrose  inverse"  and  usually 
denoted  by  A+. 

A1  2~Inverse:  Every  matrix  possesses  at  least  one 
so-called  weak  generalized  inverse,  or  reflexive 
generalized  inverse. 

A1  2  3'A1  2  4-*nverse:  Every  matrix  whose  elements 

are  real  or  complex  numbers  possesses  at  least  one  A, 

L9  Z  9  6 

and  one  2  4  *nverse‘  In  the  case  of  full  row  rank 
(full  column  rank),  there  exists  a  right  (left)  inverse 
which  in  this  case  will  be  an  A.  0  , (A.  0  .)  inverse. 

In  this  chapter  we  will  try  to  investigate  the 
theory  of  different  kinds  of  generalized  inverses. 

Let  Cm  n  denote  the  vector  space  of  all  m»n  matrices 
having  complex  numbers  as  elements. 

Characterization  of  A^  Inverse 

Theorem  (1-1) :  Let  A  be  an  m*n  matrix  whose 
elements  are  real  or  complex  numbers  (i.e.,  AeCm*n) . 

The  matrix  G  of  order  n»n  is  an  A^  generalized 
inverse  of  A  if  and  only  if  X  ■  Gy  is  a  solution  for 
consistent  equation  Ax  =  y. 
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Proof :  First,  assume  that  the  equation  Ax  =  y  has 

a  solution.  This  implies  the  existance  of  a  vector  w 
such  that  Aw  =  y  .  Substituting  with  AA^A=A  we  have 

Aw=y  AA.^  (Aw)  =y  A(A^y)=y 

The  last  equation  implies  that  x=A^y  is  a  solution  of 
Ax=y. 

Second,  suppose  that  Ax  =  y  is  consistent. 

Suppose  that  there  exists  a  solution  X  such  that 
x  =  Gy  then, 

A  (Gy)  =  y 

AGAx  =  y 

AGA  =  A 

G  is  A1  inverse  of  matrix  A. 

Theorem  (1-2) ;  Let  A  be  an  m«n  matrix  whose 
elements  are  real  or  complex.  Then,  the  matrices  A^\ 
and  AA^  are  idempotents  with  the  same  rank  as  A.  Further, 
rank  A  =  trace  AjA  =  trace  AA^  . 


Proof:  Suppose  H  =  A^A  and  F  -  AA^ 


H2  -  A,  (AA^A)  =«  ala  =  H 
and  also 

F2  =  (AAXA) Al  =  AA1  =  F 

Also,  rank  A  >_  rank  H  >_  rank  AH  =  rank  A  and 

rank  A  rank  F  ^  rank  FA  =  rank  A.  These  relationships 

imply  that  rank  A  =  rank  H  =  rank  F.  Since  H  and  F 

are  idempotent,  the  rank  is  given  by  the  trace  of  each. 

From  the  equation  AA.jA  =  A^  it  is  clear  that  rank 

A^  _>■  rank  A. 

Based  on  previous  theorem  we  can  define  A^-inverse 
of  AeCm*n  as  a  matrix  A1eCn*m  such  that  (A^A)  is  an 
idempotent  and  FMA^a)  =  R  (A)  or,  alternatively  AA'  is 
idempotent  and  R(AA1)  =  R ( A) . 

Characterization  of  A^  2  Inverse 

Theorem  (1-3):  The  necessary  and  sufficient  con¬ 
dition  for  an  A^  inverse  to  be  2  inverse  of  matrix 
A(m»n)  is  that  rank  of  A^  =  rank  of  A. 

Proof:  Suppose  that  Ax  2  is  an  {1,2}  -inverse  of 
A  ,  then 
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AA  „A  -  A  ran!:  (A)  ..  rank  (A,  0) 

A1  2AAl  2  =  A1  2  rank  (A^  ^  S.  raak  (A) 
Thus,  Rank (A)  ~  rank  (A^  2> 


Conversely,  suppose  that  A,  -  is  an  {l}-inverse 

*■  f  * 

of  A  with  R ( A)  -  R(A^  2) .  This  implies  the  following 


AA^  2A  =  A 


U(A)  =  R<A^2)  =  R  ( Ax  2  A) 

In  addition,  2A  is  idompotent  using  the  previous 

definition  of  !l}-inverse,  A  is  {].  }-inverse  of  A1  2. 
This  implies  that  A^  2AA^  2  =  A1  2  anc*  fc*lis  completes 
the  proof. 

Theorem  (1-4):  Any  A1  --inverse  of  A  can  be 

I  r  z 

expressed  as 


Al,2  =  Ai^i 

~  a 

where  A ^  and  A^  are  (possibly  different)  A^-inveroes 
of  A. 

Proof ;  Tne  proof  will  be  given  in  detail  in 


Chapter  II 


Matrices  of  Full  Rank 

Theorem  (1-5) ;  A  matrix  AeCm*n  has  a  right  inverse 
Ar  if  and  only  if  rank  (A)  =  m  (full  row  rank) .  For 
a  full  row  rank  matrix  AeCm*n,  the  following  statements 
are  equivalent: 

a.  Ar  is  a  right  inverse  of  A 

b.  Ar  is  an  A^-inverse  of  A 

c.  A  is  an  A.  -  --inverse  of  A 

i  x  9  c  ,  J 

Proof:  To  prove  that  "a”  implies  "b"  and  "c", 

suppose  that  Ar  is  a  right  inverse  of  A,  i.e., 


Premultiply:  previous  relation  by  Ar  and  post- 
multiply  by  A  we  get 

ArAAr  ~  Ar»  and 

AArA  =  A  ,  respectively. 

Thus,  A  is  an  A,  -  --inverse.  Next,  suppose  that 
A  is  a  {l}-inverse  of  A.  Since  rank  A  equals  m  ,  we 
can  find  two  nonsingular  matrices  P  and  0  such  that 

PA0  =  h*0]  '  or 

A  -  P'1  [l„|  o]  D"1 
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any  { l } -inverse  can  be  written  as  follows: 


where  U  is  arbitrary. 

Substituting  in  the  first  three  axioms  of  generalized 
inverses,  we  get 


=  (P-1  fl  I  olcT1)©!*1 

m]p(P"1flJ  ( 

Jlo-1 

L  m  1  J  [u 

—  L  m 1 

J 

*  1‘1  [Jml  °]  ■  A 


Thus,  "b"  implies  "a",  and  "c".  It  is  clear  also  that 
"c"  implies  "a",  and  "b". 

Theorem  (1-6)  :  A  matrix  Aec"1*11  has  a  left  inverse 
(A^)  if  and  only  if,  rank  (A)  *  n.  For  a  full  column 
rank  matrix  AeCm*n,  the  following  statements  are 
equivalent: 


r) 


a.  A^  is  a  left  inverse  of  A 

b.  Af  is  an  A^-inverse  of  A. 

c.  A«  is  an  A.  _  .-inverse. 

t  1  $  2  t  4 

Proof :  Proof  is  similar  to  theorem  (1-5) . 

Minimum  Norm  and  Least-squares  Solution  of  Ax-y 

Theorem  (1-7) ;  Let  the  norm  of  x  e  Rn  be  defined  as 
[  |  x  |  1  =  (x*x)*5.  Let  the  equation  Ax=y  be  consistent, 
let  the  minimum  norm  solution  x  be  x  =  Gy.  Then,  G 
is  an  A,  0  .  inverse  of  A.  In  this  case,  the  minimum 
norm  solution  of  consistent  equation  Ax=y  will  be  uni¬ 
que,  although  minimum  norm  generalized  inverse  may 
not  be. 

Proof ;  To  illustrate  the  optimal  property  of 
A.  _  .  as  stated  in  the  previous  theorem,  let  us  consider 

-*•  9  9  ** 

the  following  consistent  system  of  equations 

Ax  **  b 

where 


One  can  check  that  the  following  matrices  are 
A1  2 '  A1  2  4'  A1  2  4-inverses,  respectively. 
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Theorem  (1-8) ;  Let  G  be  a  matrix  such  that  Gy 
is  a  least  squares  solution  of  the  inconsistent  equation 
Ax  =  y  for  any  yeRm  .  Then,  G  is  an  A.  0  , -inverse 
of  A  .  A  least  squares  solution  x=Gy  may  not  be  unique, 
but  minimum  | |Ax-y| |  is  unique.  If  Gy  is  a  least 
squares  solution,  then  the  class  of  least  squares  solu¬ 
tions  is  x  =  Gy  +  (I  -  GA) Z  ,  is  arbitrary. 

Proof :  Note  that  the  least  squares  solution  is  the 

wh6T6  V 

solution  of  the  consistent  system  Ax  =  y  9  1 

is  the  projection  of  y  onto  the  column  space  of  A. 

That  means  to  find  a  solution  to  the  system  Ax  =  AGy 

where  G  is  an  A^inverse  substituting  with  general 

solution,  we  get 

Ax  -  A (Gy  +  (I-G  A) Z ) 

=  AGy  +  AZ  -  AGAZ 

*  AGy  +  AZ  -  AZ 

*  AGy  =  y  . 

Notice  that  there  are  infinitely  many  least  square 
solutions.  Let  us  consider  the  following  inconsistent 
linear  system: 
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1  1  1  |  o' 

x  =  : 


i  1  1  1 


LXJ 


A  has  the  following  {1,2,3}  generalized  inverses 


r %  %' 

o 

.  i 

oM*-' 

— - 1 

o 

o 

t  L  L  • 

’  p  V‘  ’ 

i  i 

6  6 

j 

j  0  0 

1  0  0, 

_1  1  ‘ 

L_  _ 

—  J 

-  6 

These  generalized  inverses  will  give  the  following 
least  square  solutions 


Is  h 

°1 

r°  °i 

'o' 

X1  " 

0  0 

1J 

- 

;  x2=s|  h  | 

i 

_1_ 

0  0 

h 

0  0 

i 

fol 


*  h  ; 
0 


x,  = 


1  1 
6  6 


;i 

i  6 


!1 

16 


1 

6  1 


0 

1 


ft 


1 

6 


:ii 


6> 


All  have  the  same  | ( Ax  -  y|  | 


/2 


Theorem  ( 1-.9  )  :  Let  G  be  a  matrix  of  order 

n»m  such  that  Gy  is  the  minimum-norm,  least-squares 

solution  of  Ax  =  y.  Then,  G  is  the  generalized 

inverse  A,  ,  ,  .  »  A+  of  the  matrix  A. 

1,2, 3, 4 


.3 


Proof : 


The  proof  is  direct  application  on  theorems 
(1-7)  and  (1-8).  The  previous  example  shows  that  the 
last  solution  (corresponding  to  A.  ,  ,  .-inverse)  is  the 
minimum-norm,  least-squares  solution. 
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II  Computation  of  Generalised  Inverses 


Introduction 

The  main  purpose  of  this  chapter  is  to  establish 
the  techniques  to  be  used  in  the  more  general  cases  of 
computation  of  the  various  generalized  inverses  of 
matrices . 

First,  the  elements  of  matrices  considered  will  be 
real  or  complex  numbers.  Let  Cm'n  denote  the  vector  space 
of  all  m.n  matrices  having  real  or  complex  numbers  as 
elements.  Let  I  be  the  identity  matrix;  that  is, 

IreCt,r.  Let  On*neCn*n  be  the  zero  matrix,  that  is,  all 
elements  equal  zero.  Capital  letters  will  deonte  matrices. 

In  the  next  section  the  cteneral  representations  of 
and  A^  2  are  given.  Reduction  of  a  matrix  AeC^*n 
to  its  canonical  form  using  elementary  operations  is  used 


to  derive  a  qeneral  formula  for  computation  of  A^  and  A^  2 
generalized  inverses. 

In  the  following  section,  the  techniques  given  above 
will  be  modified.  A  simple  computation  technique  for 
computing  *li2f4  and  A1i2,3<4  will 

be  given.  This  technique  will  be  based  on  the  previous 
technique  after  suitable  partitioning  of  the  transformation 
matrices . 

Next,  the  basic  technique  given  above  will  be  applied 
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through  simple  examples 


The  basic  technique  previously  given  will  be  applied 
to  investigate  the  linear  system  of  equations  Ax  =  b  and 
to  find  the  general  solution,  if  it  exists.  A  comparison 
between  this  new  technique  and  the  other  techniques  will 
be  given  through  simple  examples. 

In  the  next  section,  the  technique  for  computing 
A+  using  factorization  will  be  given. 

After  that,  the  basic  technique  given  in  the  third 
section  will  be  applied  for  computing  A^  and  A2  of  a 
specified  rank. 


General  Representation  of  A^  and  A1  2 

In  this  technique,  the  Gausian  elementary  operations 
(row  and  column)  will  be  used  to  reduce  any  matrix 
AeC”**11  to  its  canonical  form.  In  other  words,  for  each 
matrix  AeCm*n  there  exists  two  nonsingular  matrices 
ReCm*m  and  CeCn*nsuch  that 


R  A  C  =  I  = 


(  2-1) 


Theorem  (2-1) 

Let  AGCin*n  with  rank  r.  Let  ReC1"*"1  and  CeCn’n  be 
nonsingular  such  that 


m 

m 

R  A  C  = 

I 

r 

K 

0 

0 

4  ► 

m 

m 

(2-2) 
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is  a  (l,2}inverse  of  A 


Proof : 

First,  we  prove  that  2  satisfies  the  first  axiom 
for  generalized  inverses  as  follows: 


A,  -A  A 


(C  frr  I  °1  R) {R"1[*r|j°|  c"1)  (C  Tr  0  R) 


with  rank  r 


Let  R  and  C  be  as  previously 


Theorem  (2-2) 

...  _m • n 
Let  AeC 

defined  in  (2-2) .  Then 


Ai  *  C 


u 


w 


R 


where  U  and  W  are  matrices  of  proper  size.  That  is, 
and  w£c(n‘r)  (",'r)  . 


Proof:  The  proof  is  trivial  by  substituting  (2- 

Axiom-1  of  generalized  inverse.  That  is: 


A  A1  A  =  (R 


-1 


K 


C-1)  (C 


=  R~ 1 

• 

Irj  K 

Xr 

U 

0  0 

0 

w 

1 

m 

R)  (R 


-1 


,-l 


,-l 


=  R 


-1 


»  i 

j 

■ 

■ 

I  I  U+KW 

I 

K 

_-l  -1 

I 

K 

r  I  _ 

r 

C  =  R 

r 

o 

o 

0 

0 

0 

"0 

<-  •  J 

- 

- 

,-l 


Theorem  (2-3) 


Let  AeC™*11  and  P,  Q  be  nonsingular  matrices  such 


that: 


PAQ  = 


where  r  is  the  rank  of  A.  Then 


■ 

■ 

I 

u 

=  0 

r 

V 

w 

- 

« 

P; 


( 2-  4) 

UECr(m“r) 


4)  into 


) 


=  A 


(2-5) 


(2-6) 
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UrCr*(m-r)  v_.c(n-r)Xr  W(.c  (n-r)  X  (m-r) 

Proof :  given  in  (2-6)  satisfies  the  first  axiom 

as  follows: 


Remark:  There  may  be  infinitely  many  A^. 

Theorem  (2-4) 

Let  AeCm  n  and  let  X  belong  to  Cn*m,  then  X  is  a 
{ 1, 2}-inverse  of  A  if  and  only  if  X  has  the  followina 
form: 

X  =  Ax  A  Al  (  2-7) 

where  A^  and  A^  are  two  (not  necessarily  different) 

{1}  -inverses  for  A. 

Proof : 

Sufficiency  proof:  if  (2.7)  holds,  then  X 
satisfies  the  first  and  second  axioms.  That  is 
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AXA  =  A  Al  AAxA  =  A  A1  A  =  A 


and 


XAX  =  AAX)A  (Aj^AAj^)  =  Ax  AA^A  ^ 

m  'i 

=  Aj  A  Ax  =  A 

Necessity  proof:  Let  us  suppose  that  X  is 
a  {1,2}  -inverse  to  A.  X  satisfies  the  second  axiom. 

That  is 

X  =  XAX 

using  first  axiom. 

X  =  X  (A  X  A)  X  (2-8) 

=  X (AYA) X (AZA) X 

where  Y  and  Z  are  two  {1}  -inverses  for  matrix  A.  X 
given  in  (2.8)  can  be  reduced  as  follows: 

X  =  XAY (AXA) ZAX 
=  (XAY)  A  (ZAX) 

=  N  A  M 

To  complete  the  proof,  it  is  sufficient  to  prove  that  N 
and  M  are  two  {1}  -inverses  for  A  as  follows 
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ANA  =  A (XAY) A  =  (AXA) YA 


r 


=  AYA  =  A 


and 


AMA  =  A(ZAX) A  =  (AZA) XA 
=  AXA  =  A. 

Theorem  (2-5) 

Let  AGCm  n  and  let  XeCn'm,  then  X  is  a  {1,2} 
of  A  if  and  only  if  X  has  the  following  form: 


X  -  0 

_Ir 

u 

V 

vu 

P,Q  are  two 

PAQ  = 

'Ir 

0 

0 

0 

P  , 


and  V, U  are  arbitrary  matrices, 


Proof:  First,  we  prove  that  X  given  in  (2.9 
{1,2}  -inverse  as  follows: 


-inverse 


(  2-9) 


(  2-10) 


)  is 
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AXA  =  (P_1  1 r  0  Q-1) (Q  Ir  U  P) (P_1  Ir  0  Q_1) 


0  0 


V  vu 


0  0 


p-1  xr  u  Xr  0  Q-1 

0  0  0  0 


P_1  Xr  0  O”1  =  A 


0  0 


(Qt1r|U  ]  P)  (P"1!  Ir|  0  1 Q-1)  (A  rr  U  P) 


IV  vu 


0  0 


V  vu 


Ir  O  II U' 

V  O  V  vt 


=  0  rr  U 

V  vu 


P  =  X 


Second,  suppose  that  X  is  { 1, 2} -inverse, then,  by  using 
theorem  (2-4)  X  can  be  expressed  as: 


~  -  I  U'  -1  I  O  -1  T  rj1' 

X  =  A,  A  A,  =  (0  r  P)  (P  x  r  u  _Q  x)  (0  r  _  P 

1  VW  oo  v*~  w" 


0  Xr_£  Irf U"  P 

v*  o  \PT~Wr~ 


o  Mr 


p 


v*U" 


That  means  that  X  has  the  form  given  in  (2-9). 


Lemma  (2-1) 


Let  XeCn*m  and  AeCm*n  and  let  X  be 


(2-11) 


then  X  is  a  { 1, 2} -inverse  of  A  . 

Proof:  Proof  is  evident  by  choosing  U  =  0  ,  and 

V  -  0  in  (2-9) 


The  Basic  Technique 

In  this  part,  the  basic  algorithm  used  for  computation 
of  generalized  inverses  will  be  established.  Let  cm*n 
denote  the  vector  space  of  all  m»n  matrices  having  real 
or  complex  elements.  Let  Ir  be  the  identity  matrix; 
that  is,  IreCr  r*  Let  0m  °£  Cn’n  be  the  zero  matrix; 
that  is,  all  elements  equal  zero.  Capital  letters  will 
denote  matrices. 

Theorem  (2-6) :  Let  AeCm*n  with  rank  r.  Let 
PeCm*m  ,  QeCn*n  be  nonsingular  matrices  such  that 


PAQ  = 


(2-12) 


L  I  J 

Consider  the  following  partitioning  for  matrices 
P  and  Q 
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p  = 


(2-13) 


where  TeC 


r*  m 


MeC 


(m-s:)  *m  .  S£Cr-  r  #  NeC»(n-r) 


Then  the  following  pair  of  matrices 


A 

'ml 

i 

pr 

0 

T* 

I 

L  n 

0 

0 

0 

M 

s 

N 

0 

- 

are  equivalent. 

Proof :  For  any  AeC™*11  there  exists  nonsingular 

matrices  PfQ  as  defined  in  (2-12).  To  show  that  the 
matrices  given  in  (2-14)are  equivalents 


p 

0 

* 

A 

I  * 
m 

"0 

0  * 

"PA 

P  ' 

f* 

0 

0 

• 

0 

• 

‘■J 

I 

L  n 

0 

0 

L 

'm. 

I 

L  n 

0 

0 

I 

m 

_ 

m 

’l 

r 

J 

t" 

PA  9 

_P 

n 

0 

M 

A 

r\ 

u 

0 

0 

• 

■ 

.S 

N 

oj 

Theorem  (2-7);  Let  AeCm*n  ,  let  P,Q,S,T,M,N  be 
matrices  defined  as  in  (2-12)  and  (2-13),  then  Aj_  2  = 
is  a  {1, 2}-inverse  of  A. 
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Proof:  Applying  Lemma  (2-i) 


Al,2  "  0 


*  fs|»] 


-  [si  4 


=  ST 


Theorem  (2-8) :  Let  the  matrices  A,P,Q,S,T,M,N 

be  matrices  defined  as  in  theorem  (2-7).  Moreover,  let 


TMl  =  0, 


(2-15) 


then 


Proof:  First,  let  X  =  ST  and  let  TM  =0.  It  is 

obvious  that  X  satisfies  the  first  and  second  axioms. 
To  complete  the  proof  it  is  sufficient  to  verify  the 
third  axiom.  That  is 


AX 


Q"1) (ST) 


(2-16) 


then 


»-L, 


Usinq  the  fact  that  Q  Q  =  *n/ 

r»  i  -j  - 


0 


*1 

0 

r 

0 

I 

u 

i 

c 

[o’^Jcf1  n]  = 


I 

0  * 

s 

r 

0 

I  V 

n-rj 

°‘ls  *  H «  °'1"  ■  [t] 


Substituting  (2—1 7)  into  (2—16)  we  can  write 


But 


Substituting  with  (2-15)  into  (2-19) 


(2-17) 


(2-18) 


(2-19) 
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pfc  := 


r  TTfc 

_ 

0 

L  0 

MM 

multiplying  both  sides  by 


(TTfc) "L 

_ 0 

0 

(MMfc) _1 

- 

we  have 


P(P 


(TTfc) "1 

0 

0 

=  I 


m 


i  .e . , 


p"1  =  pfc 

”  (TT^) _1 

0 

.  0 

(MM*)  _1  , 

Substituting  with  P-1  into  (2-18)  we  have 


AX  =  Pt 

'(TTfc)  ~l 

0 

TJr  °] 

0 

L 

(MMt) "1_ 

■ 

0 

»  Pfc 

(TTfc) -1 

o  1 

0 

0 

(MM*)"1 J 

0 

0 

=  Pfc 

r  t  -1 

(TTC)  1 

h 

0 

p 

0 

0 

• 

which  is  symmetric 


which  is  symmetric. 

Lemma  (2-1) :  Let  the  matrices  A,P,Q,S,T,M,N  be 
matrices  as  in  theorem  (2-7).  Moreover,  let 

T  Mfc  —  0 

and 


Nfc  S  =  0  hold  , 


then 


A 


t 


Al,2,3,4 


ST 


Proof;  Referring  to  theorems  (2-8)  and  (2-9) ,  it 
is  clear  that  A+  satisfies  the  four  axioms. 


Application  of  the  Basic  Technique 

The  following  examples  will  illustrate  the  computa¬ 
tion  techniques  given  in  the  .second  section.  This  tech¬ 
nique  will  be  generalized  to  the  cases  where  the  elements 
of  the  matrix  are  polynomials  in  several  parameters. 
Example  1.  Given  the  following  matrix  . 


A  =» 


(2-24) 


It  is  required  to  compute  A1,A1  2'A1  2  3'  A1  2  3  4 
(A+) .  The  first  step  in  the  process  is  to  form  the 
following  array 
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Lin  H  0  J 


100  oo 

oio  oo 

0  0  1  0  0 


(2-25) 


Elementary  row  operations  on  A  will  affect  1^ 
only  and  elementary  column  operations  on  A  will  affect 
the  columns  of  In  only.  A  obviously  does  not  have  a 
classical  inverse,  since  it  is  not  square.  The  second 
step  is  to  perform  row  elementary  operations  on  A  to  get 
the  following  array 


111 
0  0  0 
10  0 
0  10 
0  0  1 


1  01 

-1  1 

0  0 

0  0 

0  0 


(2-26) 


Next  step  is  to  perform  elementary  operations  on 
the  columns  of  A  to  get  the  following  array 


'rm 


4 


(2-28) 


This  process  is  not  unique,  however,  one  should  check 
the  following  for  possible  errors 


Now  we  can  compute  A^  ^  as  follows 


Al,2  *  ST  * 

•  — 

1 

[1  0]  = 

.1 

0 

0 

0 

0 

0 

m  * 

0 

0 

We  can  check  the  answer  to  see  that 


A  Al,2  A  =  A  • 

and 

Al, 2  A  Al, 2  =  Al, 2 

Next,  to  get  j  3  it  is  required  to  further  modify 
(2-27)  array  by  making  the  rows  of  T  orthogonal  to  rows 
of  M.  This  can  be  accomplished  by  adding  multiples  of 
<>  second  row  of  A  to  the  first  row.  In  our  example  we  will 


3  l 


add  (^)  times  the  second  row  to  the  first  one  and  this 
will  result 


Then 


Al,2,3  =  ST  = 

Y 

[“  ']  ■ 

"*5  h 

0 

0  0 

o 

1 _ 

0  0 

Sfc  m 

To  check  the  last  result 


AA1 ,2,3 


T  i  i 

r 

dr 

ill 

r 

o 

o 

t _ 

— i 

which  is  symmetric. 

Next,  when  the  first  column  of  the  array  is  made 


to  be  orthogonal  to  the  second  and  third  one  by  adding 
1/3  times  second  column  and  1/3  times  third  colum  to  the 
first,  we  have  the  following  array 


(2-29) 


3  -1  “l 


j  +1  0 


f  I  0  1 


0  0 

0  0 

0  0 


A  =  A+  = 

1,2, 3, 4 


i/31  h  hl 

1/3 


1/6  1/6 
1/6  1/6 
1/6  1/6 


Example  2:  Let 


'10  14  28  32 

8  8  4  8 

4  12  48  48 


After  performing  column  and  row  elementary  operations, 
the  array  will  be  as  follows 

1  0  0  0  0  1/4  o]  1000 

T 

0100  h  0  0100 

0000  -2  2  1  OOOOM 


0  0 

0  0 

-2  2  1 

0  1 

12  -3 

0  0 

0  1 

1  2 

22  -4 

0 

CM 

1 

o 

-23  4 

S  N 


v'1 


To  find  A.  ~  , -inverse  or  A.  „  .  inverse  using  the 
technique  given  in  theorem  (2-8)  and  (2-9) ,  we  have  to 
establish  a  technique  to  find  T  or  S  such  that  (2-15) 
or  (2-21)  holds.  That  means  to  find  A1  2  3~inverse  we 
have  to  find  T  such  that 

T  Mfc  =  0  (2-30) 

The  direct  way  to  find  T  is  to  add  a  multiple  of 
rows  of  M  matrix  such  that  (2-30)  holds  because  this  will 
not  affect  the  canonical  form  given  in  (2-14) .  In  matrix 
form  we  can  write  T  as 

T  =  T  +  K  M 

where  K  is  chosen  such  that  (2-30)  holds,  i.e., 

(T  +  K  M)  =  0  . 

This  implies  that 

K  =  -TMt(MMt)"1 

Now  the  following  theorem  is  clear. 

Theorem  (2-10);  Let  AfCm’n  ,  M,T,S,N  be  defined  as 
in  (2—34) •  Then  the  following  holds 
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S  T 


a) 

Al,2  = 

b) 

Al,2,3 

c) 

Al, 2,4 

d) 

Al,2,3 

Proof: 

-  S  T  =  S (T-TMfc (MMfc) -1M) 

=  S  T  =  (S  -  N(NtN)"1NtS)T 

A|  V 

i  *  S  T 


It  is  sufficient  to  prove  only  part  C.  From  theorem(2-9) 
it  is  sufficient  to  calculate  S  such  that 


T 

S  N  =  0 


(2-31) 


holds.  This  can  he  done  by  adding  multiples  of  columns 
of  N  to  that  of  S.  That  is, 


S  =  S  +  NK  (2-32) 


Substituting  (2-31)  we  have 

(S  +  NK)  ^'11  =  0 
(SfcN  +  KtNtN)  =  0 
Kt(NtN)  =  -SfcN 

But  since  (NtN)  is  nonsingular  matrix,  we  have 

Kfc  =  -S^nSj)-1 
K  ■  -(N+N)"1NtS 

Substituting  into  (2-32)  we  have 
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S  +  NK 


A/ 

s  = 

=  S-N  (nSj)"1  NtS 


that  is 


Al,2,4 


will  be 


Al,2,3 


S  T 

(S  -  N(NtN)“1NtS)T 
ST  -  N(NtN)"1NtST 
(S  -  N(NtN)“1NtS)T. 


To  apply  this  theorem  let  us  consider  the  example 
(  ) .  T  can  be  calculated  as  follows 


T  =  T  —  TMfc(M  Mfc) ~1M 

-  I1  0]  -  [1  0] 


-1 

1 


-  1 1  0 1  *  111  [2J  'l  [-1  1] 


[l  o]  +  [-* 


In  the  same  way  S  can  be  calculated  as  follows 


S  =  S  -  N (NfcN) ”1  NfcS 


NfcN 


-1  1 

•1  0 


0 

1 


-T  r 


-1  -1 
1  0 

0  1. 


2  1! 
-  1  2  J 
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(NtM) “ 1 


1 

3 


I1! 

■*-1 

m 

-1 

«• 

2 

-1 

' -1 

1 

0 

1 

•— 1 

- 1 

o 

- 

1 

0 

1 

3 

-1 

2 

_  1 

0 

1 J 

0 

0. 

0 

1 

.0. 

To  apply  that  to  another  example,  let  us  consider 

r-J 

example  {2  )  to  calculate  S. 


(NfcN)  _1 


" 12  0  22  -23] 

*12 

-3~ 

*1157 

-216* 

.-3  1  -4  4  J 

0 

1 

J 

22 

-4 

-216 

42 

-23 

4  _ 

r 

(NfcN) -1  - 1 


2(969) 


42  +216 

+216  1157 
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2  in 


ro  i] 

i 

O  0 

1  2 

0  -2 
•  -* 


_1 _ 

(969) 


12  -3^ 

T42 

216  fi2 

0  22  -23j 

0  1; 

0  1 

1  ! 

i  '-3 

1  -4  4  j 

0  oj 

22  -4 

2  (969) 

L216 

i 

1157j 

J 

1  2 

-23  4_ 

0-2_ 

"  -174 

4797  ~ 

• 

-62 

-6388 

557 

-626 

_442 
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The  General  Solution  of  the  System 

Ax  ~  b :  Let  AeCra  n  and  beCm  Let  P,Q,M,N,T,S 

be  defined  as  in  Section  2.2.  It  is  proved  in  Appendix  A 
theorem (A-l)  that  there  exists  a  solution  for  the  system 
Ax  =  b  if  and  only  if 


A  A^  b  =  b  . 

In  this  case  the  general  solution  is 


X  =  A,b  +  (I  -  A.  A)  Z  , 

1  n  i 

where  Z  is  arbitrary.  Using  the  special  generalized  in¬ 
verse  given  in  previous  section,,  i.e.  A^=ST  ,  we  can  check 
the  consistency  condition  of  the  system  Ax*b  and  moreover, 
we  can  find  the  general  solution  in  an  easier  way.  This 
is  clear  from  the  following  theorem. 
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and 


Theo  rent  ( - 1 1  )  :  The  equation  Ax~b  for  A<-Cm‘n 
beCm  1  ,  has  a  solution  XeCn* L  if  and  only  if 


Mb  =  0 


(2-33) 


In  this  case  the  general  solution  is  given  by 

X  =  (ST)  b  +  N ?■  (2-34) 


where  the  matrix  Z  is  arbitrary  and  the  matrices 
M,S,T,N  are  defined  as  in  Section  (2-2). 

Proof:  For  any  A  e  Cm*n  there  exists  nonsingular 

matrices  P  and  Q  such  that  the  following  holds 


PAQ 


1 


Ax  =  b  has  a  solution  x  iff 
PAX  =  Pb  has  a  solution  x  iff 

PAQQ-  x  =  Pb  has  a  solution  x  iff 

(PAO)y  -  Pb  has  a  solution  y;  x=Oy  iff 
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X 


SW+NZ 


iff 


*  .  *  f 


W]  _  fTb]  . 
0  "  Mb  ' 


Mb=0  ;  X  «  STb  +  NZ  where  Z  is  an  arbitrary  matrix  of 
appropriate  size. 

Remark :  The  solution  X  =  (ST)b  for  the  system 
Ax  -  b  is  a  particular  solution  of  Ax  =  b,  and  NZ  is  the 
general  solution  of  the  homogenious  equation  Ax=0 . 


Example 


Consider  the  system  Ax  =  b  with 


A  =  I  2 


-1  -3  3 


b  =  5 


To  find  the  general  solution  using  both  methods,  we  must 
calculate  the  generalized  inverse  first. 

Using  the  elementary  operation  S,  we  can  write  the 
following  array: 


1  0  1  0  Onl  0  0 


0 _ 1  JO _ 0  II  -2  1  0 


0  00  0  U  5  -21 


1  -21  3  -3B 


0  0  0  1 


0  0  10 


0  1-3  0 


0  T 
"0"M 


S  N  0 


To  check  the  first  step 


To  check  the  consistency  condition 


"l  0  o’ 

T 

' 

“l" 

= 

0  10 

5 

II 

5 

-5  2  0 

5 

in 

_ i 

Thus,  the  system  is  consistent  and  has  the  following 
general  solution: 
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X  =  A, b  +  (I  -  A,  A)Z 
1  n  1 


Using  the  second  method,  we  can  first  check  the  consistency 
condition  as  follows: 


The  general  solution  is 

X  *  (ST) b  +  NZ 


'  .  i 


0 

T 

< 

0 

+W 

1 

0 

1 

0 

.3  . 

ZK 

_0_ 

It  is  easy  to  see  from  previous  calculation  that 
the  second  method  is  much  simpler. 

Remark:  The  columns  of  M  matrix  are  the  basis  for 

the  null  space  of  A. 


Another  Technique  for  Computation  of  A+ 

Theorem  (2-12)  :  Let  AeCm  n  and  have  rank  k  _<  nin 
(m,n) .  Let  A  =  BC  ,  where  rank  of  B  =  rank  of  C  =  k  ; 
B  e  Cm*n  and  CeCr’n.  Then  (BtB)"1  ,  and  (C  Cfc)  "l 
exist  and 


A.  -  ^  .  =  A+  =  Cfc(C  Ct)_1  (B+B)_1  Bfc 


(2-35) 


Proof :  Substituting  in  the  four  axioms  by 

(2-35) ,  we  have 


AA+A  =  B  (C  Cfc)  (C  C*)"1  (BtB)“1(BtB)’1(BtB)C 


=  BC  =  A 


and 


'■  1  1  — 
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and  finally 

4-  t  f  —1  t  —1  +■ 

A  A  =  C  (CCT)  (B^B)  B  C 

=  ct(cct)‘1c 

Theorem  (2-13):  Let  AcCm"n,  let  Pr  Q  be  nonsinqular 
matrices  such  that 


where  B  -  P 
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Proof : 


The  purpose  of  this  section  is  to  compute  and 

of  a  specified  rank.  In  chapter  I  we  proved  that 
the  rank  of  A^  must  be  greater  than  or  equal  to  the 
rank  of  the  matrix,  i.e., 

r  £  RfA^  £  min(m,n)  . 

It  is  clear  that  rank  of  A2  will  be  less  than  or  equal 
rank  of  A. 

Now  let  AeCm'n.  Let  P  and  0  be  two  nonsingular 
matrices  such  that 


Let  P  and  0  be  partitioned  as  follows 
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,(k)  _  _k*m 


M(k)r  c(T'k)m 


<k)ecn-k  and 


where  T  e  C 

M(k)  cn(n-k)  where  0  £  k  £  min(m,n)  and  the  following 
theorem  uses  the  partitioning  given  in  (2-36)  to  compute 
A^  and  A2  of  specified  rank  k. 

Theorem  (2-13) :  Let  A£Cm*n.  Let  P  and  0  be  non¬ 
singular  matrices  such  that 


PAQ  = 


Let  T(k)  , 


_Ir 

0 

_0 

_ 

0_ 

M(k) 

/ 

(  )  .  Let  A,' 


(k) 


(k) 


and  A 


(k) 


■  00 


denote  A^  and  A2  of 


specified  rank  k,  then 


(k)  =  s (k)  T(k)  rz  k£  min(m,n) 


A(k)  s  s(k)  T(k) 


u  ^  k  r 


Al  2  =  S(r)  T(r)  =  ST 


where  S  and  T  are  given  as  in  Section  (2-3) . 

Proof :  The  proof  will  be  easy  if  we  notice  that 


(k)  T(k)  =  Q 


[hLl 

•  0|0J 


(2-37) 


To  complete  the  proof,  substitute  (2-37)  into  the  first 
and  second  conditions  of  generalized  inverse  as  follows: 
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This  implies  that 


r 


Example  (4  ) : 

Given 


A  = 


12  1 
4  2  6 

3  3  4 


s 


(3) 


~1  -2  -S/P, 

I  0  L  1/3  | 


L°  0  1  -I 


T(0) 


0 


9 


T(1)  =  [.l  0  0  J  , 


T 


(2) 


1  0  0 
_2/3  -1/6  0. 


and  finally 


1 


t(3)  = 


2/3 


0  0 
-1/6  0 


-1  -1/2  lj 


Now  the  generalized  inverses  can  he  computed  as  fol low 


A 


(0) 

2 


=  0  ,  the  trivial  solution  for  XAX=X. 


0 


0  0  °] 


10  0 
0  0  0 
0  0  0_ 
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Ill  Matrices  Over  a  Polynomial  Ring 
in  Several  Variables 


Introduction 

The  purpose  of  this  chapter  is  to  establish  an  algorithm 
to  compute  the  generalized  inverses  for  the  matrices  whose 
elements  belong  to  a  polynomial  ring  in  several  variables 
having  coefficients  which  are  complex  numbers.  In  the 
previous  chapter,  we  established  many  algorithms  to  com¬ 
pute  the  generalized  inverses  for  constant  matrices  using 
reduction  of  the  matrix  to  the  following  canonical  form: 


(3-1) 


In  the  case  of  matrices  with  polynomial  elements  in 
several  variables,  it  is  not  true  that  all  matrices  can  be 
reduced  to  t he  form  (3-1).  To  study  the  existence  of 
generalized  inverses  of  those  matrices,  we  have  to  reduce 
the  matrix  to  another  simple  form  called  the  Smith  Normal 
form.  Thus,  the  study  of  the  existence  of  the  Smith  form 
for  matrices  over  the  polynomial  ring  will  be  a  necessary 
step  to  characterize  those  matrices  which  admit  generalized 
inverses  and  to  compute  them  in  case  of  their  existence. 

In  the  next  section,  we  will  study  the  existence  of 


the  Smith  form  for  matrices  with  polynomial  elements. 
Conditions  under  which  a  matrix  over  a  polynomial  ring  is 


*  . 

V 

* 


equivalent  to  its  Smith  form  will  be  investigated.  A 
systematic  algorithm  to  reduce  a  matrix  to  its  Smith  form 
will  be  established. 

In  the  third  section/  the  necessary  and  sufficient 
conditions  for  a  matrix  over  polynomial  ring  to  have  a  cer¬ 
tain  type  of  generalized  inverse  will  be  given.  The  com¬ 
putation  of  generalized  inverses  for  matrices  with  con¬ 
stant  rank  size  having  Smith  form  as  in  (3-1),  will  be 
established. 

In  the  fourth  section,  the  case  of  variable  rank 
matrices  will  be  investigated.  The  problem  of  finding 
solutions  of  the  system  Ax  =  b  in  the  case  of.variable  rank 
matrices  will  be  discussed,  since  it  was  not  treated  before 
in  any  preceeding  work. 

Throughout  this  chapter,  C  will  denote  the  field  of 
complex  numbers.  Let  R  =  c  *  *  * '9n]  be  the  ring 

of  polynomials  in  the  variables  91,92r .  ..,9n  with  co¬ 
efficients  belonging  to  C.  Let  Cm<n(9)  be  the  vector 
space  of  all  matrices  of  order  m<n  and  their  elements 
belong  to  R. 


Reduction  to  Smith  Form 

Consider  any  m*n  matrix  A(9)e  The  Smith 

form  S  (9)  e  of  the  matrix  A(o)  is  defined  to  be 
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i 


S  (0) 


m<n 


S(9) 

S  (0) 


^diag 

kk 

!  1  o] 

;  m<n 

^diag 

k<0>' 

|] 

;  m=n 

J"diag 

k‘0f 

;  m>n 

L  o  ■ 

i 

where  ei(9)  are  the  invarient  polynomials  over  R  of  A (9) 
given  by 


d.  (9) 

e.  (9)  =  -r1 - (i-l,2,...,min(m,n)) 

i 


where  dQ(e)~  1  and  d ±(Q)  is  the  greatest  common  divisor 
(g.c.d.)  of  all  the  ith  order  minors  of  A(9). 

Example  (  3-]) 

Given 


A  ( X) 


1  X-l  X+2 

2  2 

X  X^  X*+2X 

X-2  X2-3X+2  X2+X-3 


then,  one  can  compute  the  following 
dQ(X)  A  l  , 
dx (X)  =  1  , 

d2(X)  =  1  , 

d3  (X)  -  X(X+1), 

and  the  invariant  factors  of  A(X)  are  as  follows: 
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eL(X)  =  y  =  1, 
e2(X)  =  y  =  1, 


and 

e3(X)  =  =  X  (X+l)  . 


Thus  the  Smith  form  S(X)  associated  with  A(X)  is 


S  (X ) 


10  0 

0  1  0 

0  0  X(X+1) 


Example  (3-2) 
For 


A(s,z)  = 


"s  0  l‘ 

0  sz+1  i 

0  0  z 


d1(s,z)  = 
Thus  the  Smith 


d2(sfz)  ■  1  ,  d3(S/Z)  =  sz(sz+l). 
form  associated  with  A(sfz)  is 


S(s,z) 


10  0 

0  1  0 

0  0  sz (sz+1) 
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Given 


A(s,z)  = 


s+1  l+z(s+l)  0 
s  sz+1  -(s+1) (s+z) 
o  s(s+l)  (s+1) 
s+1  l+z(s+l)-(s+l)  (s+2) 


(s+l)z  z2 
sz  z2 
s(s+l)  sz 
(s+l)z  z2 


then,  one  can  compute  the  following  d^(s,z)  = 
d2(s,z)  =  1,  d3(s,z)  =  l(s+l) ,  and  d^(s,z)  =  s(s+l) 
Thus,  the  diagonal  elements  are 
e1(s,z)  =  e2(s+z)  =  1, 


e3  (s,z) 


(s+1) , 


and 


e4(s,z)» 


S (s+1) 2 (s+z) 
(s+1) 


S (s+1)  (s+z)  . 


Finally,  the  Smith  form  S(s,z)  of  A(s,z)  is 


1 


0 


0 


0 


0^ 


s  (s,  z) 


0  10  0  0 

0  0  (s+1)  0  0 

0  0  0  S(s+l)(s+z)  0 


(s+1) 


(s+z) . 
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Equivalence  of  a  Matrix  over  R  to  Its  Smith  Form 

A  matrix  A (9)  e  Cn"n(0)  is  said  to  be  unimodular 
if  | A ( 0 ) |  is  nonzero  complex  number.  It  is  clear  that 
any  unimodular  A(0)  eCn*n(0)  has  inverse  A-1(8)  which  be¬ 
longs  to  Cn*n(8).  Let  A (6 }  and  B(0)  be  matrices  which 
belong  to  Cm>n(0),  then  A(0)  and  B(9)  are  said  to  be 
equivalent  over  R  if  and  only  if  there  exists  two 
unimodular  matrices  P(0)e  CnL*m(0)  and  Q< 0 )  e  Cn*n(0)  such 
that: 


P  (0 )  A (0 )  0(0)  =  B (0 )  .  (3-2) 

It  is  easy  to  show  that  equivalent  matrices  over 

have  the  same  Smith  form  over  R.  The 
converse  of  this  fact  is  true  in  the  case  of  matrices 
over  the  field  R  in  one  variable  since  A(9)e  Cm‘n(0)  in 
one  variable  is  always  equivalent  to  its  Smith  form. 

Although  it  is  true  that  two  matrices  A(e),  B(0) 
are  equivalent  over  s  «  R  £zj  (ring  of  polynomial  in 
variable)  if  and  only  if  they  have  the  same  Smith  form, 
this  may  not  be  true  for  the  case  of  matrices  over 
R  *  cjj9i''®2r  *  *  *®r]  (polynomials  of  more  than  one  variable). 
The  following  two  examples  illustrate  this  fact  over  the 
field  R  =  c£s,z]  (see  Frost  and  Storey  ). 


=  cfe^i 
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The  following  matrices 


s+z 

0 

z"1 

s+z 

0 

l" 

0 

s+z 

0 

,  B (s,  z)  = 

0 

s+z 

0 

0 

0 

s 

0 

0 

s 

have  the  same  Smith  form  S(s,2) 


S (s,z) 


0  0 
s+z  0 
0  s(s+z) 


but  there  is  no  transformation  of  equivalence  over 
R  ■  C  £s,z]  such  that  13.2)  holds.  Thus,  A(s,z)  and 
B(s,z)  are  not  equivalent.  This  is  clear  since  they 
have  different  rank  at  s=2*0. 


Example  ( 3  -5 )  : 

The  following  matrices  are  not  equivalent  although 
they  have  the  same  Smith  form: 


S  0  1 

S  0  0 

A (s, z)  = 

0  sz+1  I 

,  B (s,z)  » 

0  sz+1  1 

0  0  z 

_0  0  Z_ 
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Their  Smith  form  is 


S (s,z) 


10  0 
0  10 

0  0  SZ (sz+1) 


Frost  and  Storey  ^8]  ,  ^9]  investigated  the 

sufficient  and  necessary  conditions  for  a  matrix 
A ( 0)  eCm’n(0)  in  two  variables  to  be  equivalent  to  its 
Smith  form.  These  conditions  involve  a  new  concept  called 
zeros  of  a  matrix  over  R. 


Zeros  of  a  Matrix  over  R 

For  a  matrix  over  R  £sj  it  is  certainly  the  case 
that  if  the  determinantal  divisor  d^s)  is  removed  from 
all  the  ith-order  minors,  then  the  remaining  polynomials 
cannot  be  simultaneously  zero  for  any  value  of  S.  This 
result  does  not  extend  for  matrices  over  r£s,J  . 

It  is  quite  possible  that  on  removal  of  the  deter¬ 
minantal  divsor  d^SjZ)  from  all  the  ith-order  minors  of 
a  matrix  A(s,z)  over  r|s,z]  ,  the  remaining  polynomials 
may  all  be  simultaneously  zero  for  one  or  more  values 
of  the  pair  (s,z).  Such  value  of  (s,z)  will  be  called 
an  ith-order  zero  of  A(s,z). 
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Example  (3.6) : 

Let  us  consider  the  same  matrices  given  in  Example 
(3.4).  A(s,z)  has  zeros  of  first  and  second  order  at 
(0,0)  while  B(s,z)  has  no  zero  of  any  order. 


Example  (3.7) : 

Let  us  consider  the  same  matrices  given  in  (3.5). 

A (s, z)  has  no  zero  of  any  order  while  B(s,z)  has  zero 
of  second  order  at  (0,0). 

It  is  important  to  note  that  the  Smith  form  S(s,z) 
has  no  zero  of  any  order.  To  prove  that,  it  is  quite 

L  U 

sufficient  to  note  that  the  determinant  of  i  principal 
minor  is  d^(s,z)  as  follows: 


determinant  of  ith  principal  minor 

=  e,  (s, z)  •  e-(s,z),...e. (s,z)  =  dl(s'z) *  d2(s'z) .... 

1  d^SjZ) 

...  di(s,z) 
di_i (s, z) 


_  d± ( s , z ) . 


The  Conditions  for  Equivalence  Over  R[s,a 

It  is  important  to  note  that  a  transformation  of 
equivalence  over  r[s,z|  preserves  the  zeros  of  a  matrix 
over  R^s,zj  .  From  this  remark  it  is  clear  that  matrices 
over  R^s,zj  having  the  same  Smith  form  over  R^s,zj 
but  not  having  the  same  zeroes  are  not  equivalent  over 
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Applying  this  fact  to  matrices  A(s,z)  and  B(s,z) 


R  ^s ,  z j  . 

given  in  example  (3.4),  it  is  clear  that  both  matrices 
A(s,z)  and  B(s,z)  do  not  have  the  same  zeros;.  Since 
Smith  form  over  R^s,zJ  has  no  zeros,  it  is  clear  that 
a  matrix  over  R^s,zj  which  has  zeros  cannot  be  equiva¬ 
lent  to  its  Smith  form  over  R^s,zj"  . 

As  an  example,  note  that  the  following  matrices 


s+z 

0 

•— 

z 

s 

0 

— 

0 

A(s,z)  = 

0 

s+z 

0 

,  B(s,Z)  = 

0 

sz+1 

1 

0 

0 

s 

0 

0 

z 

are  not  equivalent  to  their  Smith  form  because  A(s,z)  has 
zero  of  order  1  and  2  at  (0,0);  and  B(s,z)  has  a  zero  of 
second  order  at  (0,0) . 

An  example  of  matrices  over  R  £s,zj  which  are 
equivalent  to  their  Smith  form  is  as  follows: 


s+z 

0 

1 

s 

0 

r 

A(s,z)  = 

0 

s+z 

0 

t  B(s,z)  — 

0 

sz+1 

1 

_  0 

0 

s 

0 

0 

z_ 

Note  that  both  matrices  have  no  zeros  of  any  order. 

Lee  and  Zak[  19  ]  proved  that  lack  of  zeros  of 

any  order  is  not  a  sufficient  condition  for  the  equiva 
lence  of  A(s,z)  over  R^s,zj  to  its  Smith  form.  For 
example  the  matrix 


-s-1 


A(s,z)  = 


2 

-s  z 


has  Smith  form 


S (s,z)  = 


0  z2-s2(s+l) 


but  A(s,z)  is  not  equivalent  to  S(s,z)  although  A(s,z) 


has  no  zeroes  of  any  order. 


Construction  of  Transformations  of  Equivalence 

In  the  previous  part  we  have  investigated  the 
equivalence  of  a  matrix  to  its  Smith  form  over  the  ring 
of  polynomials  in  several  variables.  The  question  now 
is  how  to  construct  transformation  of  the  form  (3,2)  which 
reduces  the  matrix  A(0)e  Cm’n(9)  to  its  Smith  form  S(e). 
That  is,  to  find  unimodular  matrices  P(0)  and  0(0)  such 
that 


P(0)  A (0 )  0(0)  =  S(0) 


(3-3) 


As  we  did  in  Chapter  II,  P(0)  and  0(0)  can  be  con¬ 
structed  using  elementary  row  and  column  operations,  if  one 
modifies  the  standard  definitions  of  elementary  operations 
to  include: 


1)  Multiplying  of  a  row  (column)  by  an  arbitrary 
nonzero  scalar  constant, 

2)  adding  to  a  row  (column)  the  elements  of  another 
row  (column)  multiplied  by  an  arbitrary  polynomial  which 
belongs  to  the  ring  R  02,.. . .  ,0  J  ,  and 

3)  interchanging  of  two  rows  (columns) . 

The  construction  of  matrices  P(9)  and  Q(9)  is,  in 
general,  a  difficult  step.  Frost  and  Storey 
suggested  a  systematic  procedure  to  reduce  matrix  A(s,z) 
over  the  ring  R  [s,z]  to  its  equivalent  Smith  form  using 
elementary  transformations.  The  first  step  in  this  pro¬ 
cedure  is  to  bring  the  matrix  A(s,z)  via  a  transforma¬ 
tion  of  equivalence  over  R^s,z|  to  the  form 

e^s^)  it  (s,z) 


where  e1(s,z)  is  the  first  invariant  polynomial  of 
A(s,z),  and  A'  (s,z)  has  the  form 


A' 


1 


0 


Al(s,z)J 


It  follows  that  A(s,z)  is  equivalent  over  r£s,z]  to 
a  matrix  of  the  form 

e1(s,z)  0 

0  e^s^JA^s^) 
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By  repeating  this  procedure  for  the  matrix  e^(s,z)A^(s,z), 
we  can  find  a  transformation  which  reduces  e^(s,z)A^(s,z) 
to  the  form 


e,(s,z)  A" (s, z)  , 


where  A"(s,z)  has  the  form 

’l  0 

A" (s, z) 

0  A-(s,z) 


That  means  A(s,z)  is  equivalent  over  r|s,z]  to 


e^  (s, z)  0  0 

0  e2(s,z)  0 

L  0  0  e2  (sfz)  A2  (s,z)J 


This  procedure  can  be  successfully  repeated  until 
A(s,z)  has  been  brought  to  its  Smith  form  under  condi¬ 
tion  A(s,t)  is  equivalent  to  its  Smith  form.  This 
procedure  will  be  explained  in  the  next  example. 


Example  (3-8)  : 
Given 
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A(s,z)  = 


s+1 

1+z (s+1) 

0 

(a+1)  z 

z2 

s 

sz  +  1 

- (s+1) (s+z) 

sz 

z2 

0 

s  (s+1) 

S+1 

s (s+1) 

sz2  (s+1) 

s+1 

1+z (s+1) 

- (s+1) (s+z) 

(s+1)  z 

z2 

one  can  compute 
e1(s,z)  =  1 


A ( s / z )  is  brought  using 

fl  -1 


and 


P1  * 


0,  = 


-s  1+s 

0  0 

-1  0 


-z 

1 

0 

0 

0 


0 

0 

1 

0 


0 

0 

0 

1 


(s+1) (s+z) 
0 
1 
0 
0 


-z 

0 

0 

1 

0 


to  the  form 


[o  A^  ( s ,  z  )J 


where 
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Aj^SfZ) 


'  1  -(s+l)2(s+z) 

s(s+l)  s+1 

0  *■  { s+1)  ( s+z ) 


s(s+l)  sz  (s+1) 
0  0 


Now  e2(s,z)  =  1. 

Again,  A^SjZ)  is  brought  using 


p2 


1  0  0 

-s(s+l)  1  0 

0  0  lj 


and 


°2 


1 

0 

0 

0 


(s+1) 2 (s+z)  0 

1  0 

0  1 

0  0 


to  the  form 

f1  °  1 

0  1  A2(s,z)J 


where 
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(s+1)  *-  (1+s  (s+l)  2  (s  +  z)  ) 


1  A2(s/z)  = 


-  (s+l) (s+z) 


Then,  using 


1  s(s+l)2 

jjs  +  z)  1+s  (s  +  l)  2  (s  +  z) 


and 


1 


°3 


0 

0 


-s 

1 

0 


0 

0 

1 


A2  is  brought  to  the  form 


s+l  0  0 

0  s(s+z) (s+l)  0 

_  m 


s(s+l)  0 

0  0 


Finally,  A(s,z)  is  reduced  to  the  Smith  form 


The  main  purpose  of  this  section  is  to  establish  an 
algorithm  to  compute  different  generalized  inverses  of  a 
given  matrix  A(e)  e  (^'"(Q).  Throughout  this  section, 
it  is  assumed  that  a  matrix  A (8)  is  equivalent  to  its 
Smith  form;  i.e.,  to  the  following  form: 

(3-4) 

where  n(0)  is  a  diagonal  matrix  which  belongs  to 
c(m  r)  (n  r)  ^  ^  That  is,  there  exist  two  unimodular 
matrices  P(8),  0(8)  such  that 


P  (0 )  A(0)  0(0) 


0  SH0) 


(3-5) 


The  next  theorem  establishes  the  necessary  and  sufficient 
conditions  for  a  matrix  A(0)e  Cm*n(0)  to  have  generalized 

inverses  '  Al,2,3^  •••••  etc* 

Theorem  (3.1);  Sontag  [27] 

Let  A (0 )  e  Cm  *n  (0 )  ,  then  A(0)  has  Aj_  2  -  inverse 
which  belongs  to  Cn*m(0)  if  and  only  if  A (0)  has  a 
constant  rank  r  <_  min(m,n)  . 


Proof :  Suppose  that  A(0)  has  constant  rank  r, 

then  A ( 0 )  can  be  reduced  to  the  form  (3-1)  using  ele¬ 
mentary  transformation  as  follows: 


P  (0 )  A (0 )  0(0) 


(  3-6) 


where  P(0)  and  0(0)  are  unimodular  matrices,  let 


X 


P(0) 


It  is  clear  that  X  is  an  A1>2(0)  inverse  of  A(9). 
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Remark 


By  proper  partitioning  of  the  matrices  P(6)  and  0(8) 
given  in  (3-6)  in  the  same  way  as  in  the  case  of  constant 
matrices,  i.e.. 


P(0) 


T(0) 

.mTST 


0(8)  «  f  s  (6  )  |  N  ( 8  )  J 


then. 


Al,2(9)  *  S(9)  T(9) 


Remark 

In  the  case  of  full  raw  rank  matrix  A(8)e  C111  ,n ( 0 )  , 
i.e.,  rank  of  A(0)  =  m 


Al,2,3 


(9) 


Remark 

In  the  case  of  full  column  rank  matrix  A(0)e  Cm"*n(0), 
i.e.,  rank  of  A(0)  *  n 


(0)  *  A 


1,2,4 


(0) 
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Let  A (9)  be  a  polynomial  matrix  which  belongs  to 
Amxn ^ 0 )  #  Let  p(Q)  an<j  Q(9)  be  unimodular  matrices  such 
that 


P(e)  A(0)  0(9) 


fr. 


where  (2(9)  e  C^m  (0)  ,  then  there  exists  A2(9) 

with  rank  k  <_  r  in  the  following  form 


A2(0)  =  0(9) 


P  (9)  . 


(3-7) 


By  proper  partitioning  of  P(9),  0(9)  as  follows 


P  (9 ) 


T(9)' 
M  ( 0 ) 


A ( 0 )  =  [s  (0 )  |  N (0 )j 


where  T(0)  e  Ck,n,(9)  ,  M(0)  e  C (m_k)  *m(0)  , 

S(0)  e  Cn*k(0)  ,  and  N(0)  e  Cn* (n"k)  , 


A2(9)  can  be  written  as 


(3-8) 


A2(9)  *  S (0)  T ( 0 ) 


(3-9) 


Proof : 


0(0) 


=  0(0) 


Flk  0 


1^1° 

0  0 


- 

X. 

m 

0 

0 

0(8) 

P(0) 


XJL 

o  To.  o  i 


(0) 


fk__i 

O  0 


=  0(0)  Ik[  0  P  ( 0 )  =  A_(Q) 
0  0.  * 


This  proves  the  first  part;  to  prove  the  second  part, 
substitute  (  3-8)  into  the  relation  (3-^  as  follows 


a2(9) 


xk  0 


=  0(0) 


=  ^(0)|ni 


»  S(0)  T  (0)  . 


Xk  0  T  (0 ) 

0  OJLm(0). 


■  s(8,i°]  pHfi 


In  the  following  part  we  will  demonstrate  the  exis¬ 
tence  of  different  types  of  generalized  inverses  over 

R  *  c  [ei'e2'"*'0n]  * 


Example  (3  -9)  : 
Given 


I 


following  matrix 


One  can  check  the  answer  by  substituting  in  the  first 
three  axioms  (  )  as  follows: 


s  sz  10 

^  ^1  2  3^  =  2 

-sz  1-z4*  0  1 

-10  0  0 


0  0  -1 


0  0  0 


1  0  s' 


-sz  1-z' 


-1  0 


0  1  -sz 


=  -sz  l-z‘  0  1 

-10  0  0 


0  sz 


0  1— z  * 


sz  1  0 


=  l-sz  1-z 


-1  0  0  0 


also 


Al,2,3  A  Al,2,3 


10  0  -1 


0  0  0 


1  0  s' 


0  1  -sz 


s‘  sz  1  0 


-sz  1-z  0  1 


_-l  0  0  0J  1  0  s 

0  1- 


0  0- 


0  0  0 


0  0 


0  0  0  0 


0  sz  1  0  1  0  sz 
0  1-z2  01  0  1  -sz 


0  0-1 


‘1,2, 


and  finally 


A  A 


1/2/3 


2 

“  — 

s  sz  1  0 

0  0-1 

-sz  1-z2  0  1 

0  0  0 

-10  0  0 

0  1  -sz 

J 

-  j 

10  0 
0  10 
0  0  1 


Also,  we  compute  A2(9)  with  rank  =  1  as  follows: 


A2(0)  - 


[l  0  s2] 


0 

0 

0 

0 


and  finally  we  compute  A2(8)  with  rank  =  2  as  follows: 


A2(9) 


r 

r  2 

r*  “ 

0  0 

10s 

0  0  0 

0  0 

0  1  -sz. 

0  0  0 

1  °. 

10  s2 

0  1 

0  1  -sz 

•  « 

•  m 

To  compute  A^  2  34(0),  it  is  necessary  to  satisfy 
the  relation 

Sfc  N-0. 
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It  is  the  case  of  nonconstant  rank  so  we  can  construct 
only  A2(s#z)  with  rank  =  1  as  follows 


A2 (s,z) 


0]  1  0  0 


0  0  0 
0  0  0 


10  0 


Example  (3-11) 


Given 


then  one  can  construct  the  following  arr?y 


and  A- (9)  with  rank  *  2  is 


■ 

K 

■ 

Example  (3-12} 

For  the  matrix 


A (S / Z ) 


S  Z  S2+Z+l 

2  2  3  2 

S  Z  SZ  S  Z+SZ^+SZ 


we  can  construct  the  array 


— 

1  0 

0 

1  0 

0  0 

0 

-SZ  1 

-S  SZ 

l+S^ 

-1  1+Z 

z 

1  -z 

-s 

m 

•  « 

-s 

[l  o] 

A12(Z)  = 

-1 

» 

1 

»  < 

— 

0 

0 

0 


The  Matrices  with  Variable  Rank 

For  A (8)  e  Cm*n(8)  and  A (8)  does  not  have  constant 
rank#  the  problem  of  finding  solutions  of  A(6)  x(9)  ■  b(8) 


8 


■i 


arises  since  it  was  not  treated  by  Sontag  2 7j  and  others. 
Throughout  the  next  section  it  will  be  assumed  that  there 
exist  unimodular  matrices  P(0),  0(9)  belonging  to 
Cm’m(0)  ,  Cn‘n(6) ,  respectively,  such  that: 


P  (6)  A ( 0 )  0(0) 


fl(6) 


=  V9) 


(3-10) 


where  AQ{9)  is  the  Smith  form  of  A(0).  Such  matrices  were 
treated  by  Frost  and  Storey[8,9  j,  and  Lee  and  Zak[  19  j, 
when  matrices  were  reduced  to  their  equivalent  Smith  form: 


Theorem (t-t) : 

Let  A ( 6 )  e  Cm*n{9)  ,  b(9)  e  Cm'1(8)  ,  and  A(0) 
has  nonconstant  rank  k(6),  i.e.. 


1  <  r  <_  k(9)  min(m,n)  . 

Let  P(9)  ,0(9)  be  unimodular  matrices  such  that: 


P(0)  A ( 9 )  0(9) 


0(0) 


L  »  J 

then  the  following  partitioning  of  matrices 


A  (9) 

Im 

/ 

_Ir 

0 

TO)  " 

0 

TO. 

.  MO) 

Lxn 

0 

so; 

N(9; 

0 

(3-11) 


are  equivalent  over  R  =  C 


t  •  •  •  t 
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Proof :  Proof  is  similar  to  constant  case  of  A. 


Theorem  (3-4): 

Let  A(e)  e  Cm#n(e) /  and  A(9)  have  nonconstant 
rank,  i.e., 

1  <  r  £  rank  of  A(e)  <,  min  (n,m)  , 
and  the  hypothesis  of  theorem  (3-3)  holds. 

Then  the  following  set  of  equations 

A(e)  X  (0 )  =  b(6)  (3-12) 

has  a  solution  X(0)e  Cn’^(0)  if 


M ( 6 )  b(e)  =  Q(e)  zc©)  ,  (3-13) 

for  some  Z(0)e  C * 1 (9) ,  and  in  which  case  the  general 
solution  X(9)  is  given  by 


X ( 9 )  *  S (9)  T ( 0 )  b(9)  +  N (0 )  Z(9)  (3-14) 


where  S(0),  T(9),  N(9),  n(0)  are  given  as  in  (3-11) 

Proof:  For  any  A(0)  e  C^’^O)  there  exist  unimodular 
matrices  such  that 


P(0)  A (0)  0(0) 


■ 

Jr 

0 

0 

0(0) 

<3-15) 


where  P(6)  e  Cn'*m(9),  and  0(0)  e  Cn’n(9).  A(6)  X(0)  -  b(e) 


has  a  solution  X(9)  iff 


P  (9 )  A  (0 )  X<9)  =  P(9)  b  (0  )  has 
(P  (0 )  A  (9 )  0(0))  0_1  (6 )  X  (0  )  = 

y(9)  =  P (0 )  b (9 )  has 


• 

Tr 

0 

0 

(2(9) 

a  solution  X(9)  iff 
P ( 0 )  b ( 0 )  has  a  solution  X(9)  iff 

a  solution  y (9) (9)  >OC(9)  iff 


y  (0) 


mi 

2(0) 


x ( 0 )  =  0(0)  y (0) 


N  (9 ) 


The  last  set  of  equations  can  be  written  as 
W(0)  =  T (9 )  b ( 0 )  ;  (2(9)  Z  (9 )  =  M(0)  b(9)  ; 

X  (0 )  =  S  (0)  W ( 0 )  +  N  (0 )  Z  (0 )  . 

Thus,  the  solution  of  A(Q)  X(Q)  *  b(0)  will  be: 

X(0)  =  S(0)  T (0)  b (9)  +  N (9 )  Z  (0) 
in  the  condition  that 

(2(9)  Z(0)  =  M ( 0 )  b(9),  holds  for  some  Z(0)eC*n  r^#1(0) 
with  appropriate  size. 


Example  (3-13) 

Consider  the  following  system 


» 

1-X 

s 

X 

0 

X(S,X,3  - 

X(S+Z) 

z 

sz-x 

0 

X 

x^+szx 

Using  the  elementary  operations  we  can  construct 
the  following  array: 
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To  check  the  consistency  condition,  one  may  cal 
culate 


X (s+z) 
X2+SZX 


that  is 


X  a(9)  *  -ZX(s+z)  +  X2+SZX,  and  d(9)  ,  c(9)  are 
arbitrary  polynomial. 


z  (0)  « 

-Z (s+z) +X+SZ 

d  (9) 

_ 

x-z2 

d  (9) 

.  c(0) 

c  (9) 

*  mJ 

The  general  solution  will  be 


X  =  S  T  b  +  NZ 


hi 


X 


X  (s+z) 


f-xz  1  fxs  1  f  0  1 


The  main  purpose  of  this  part  is  to  establish 
methods  to  solve  algebraic  Riccati  and  Lyapunov  equa¬ 


tions.  The  Lyapunov,  equation 

AX-XB  =  C  (  4-1) 

and  the  Riccati  matrix  equation 

AX-XB  +  xDx  =  C  (4-2) 

where  the  matrices  A,  B,  C,  and  D  have  elements  which  belong 
to  the  field  of  complex  numbers. 

Riccati  and  Lyapunov  equations  are  very  important 
to  establish  methods  to  enable  any  one  to  do  systems 
decomposition,  i.e.,  transform  large  systems  to  uncoupled 
small  subsystems.  Such  process  requires  the  solution 
of  the  equations  (4-1)  or  (4-2)  . 

At  first  the  elements  of  all  matrices  are  real  or 
complex  numbers.  The  notion  of  strong  similarity  of  the 
following  pair  of  matrices: 

[H4] 


and 
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Equation  (4-5)  holds  whenever  the  equation  -AX+XB+C=0 
has  a  solution  (i.e.,  equation  (4-1)  ). 

For  applying  to  systems  decomposition  consider  the 
following  differential  system  of  equations. 


dx  (t) 
dt 


#* 

A 

• 

c 

0 

B 

■ 

x  (t) 


(4-6) 


let 

x(t) 


y  (t) 


where 


AX  -  XB  -  C 


(4 


This  system  is  reduced  to 


f 


So  the  systems  (4-6)  and  (4-3)  are  strongly  similar  under  the 
condition  that  (4-1)  holds- System  (4-8)  is  uncoupled  sub¬ 
systems  . 

Now  consider  the  fully  coupled  differential  system  of 
equations 


dx(t)  [b  Id 

dt  C  A 

•  • 


x  (t) 


applying  the  following  transformation  to  (4-9) 


(4-9) 


x(t) 


(4-10) 


where  X  is  a  solution  of  (4-2)  with  matrices  coefficient 
defined  as  in  (4-9) . 


dv(t) 

ifo' 

■  - 

B  D 

i_l 

* 

_0 

dt 

x|l 

m  «■ 

C  Aj 

FI 

I 

• 

y  (t) 


- 

- 

• 

" 

B 

D 

±- 

.CL 

xb+c 

XD+A 

X 

1 

- 

- 

m 

j 

■ 

| 

= 

B-DX 

_ D 

XB+C-XDX-AX 

XD+A 

L 

j 

» 

■ 

1 

B-DX 

L-  D 

/  \ 

0 

1  XD+A  i 

X  v  w 

M 

i 

J 

(4-11) 


Equation  (4-11)  represents  a  partially  coupled  system 
instead  of  ( 4-9)  . 


Applying  another  transformation  as  before  using 
Lyapounov  equation  (4-1),  the  system  will  be  reduced  to 


dz ( t)  = 

B-DX 

• 

0 

dt 

0 

» 

A+XD 

In  the  next  two  parts  we  will  establish  the  different 
techniques  and  approaches  used  to  solve  equations (4-1) 
and  (4-2) . 


Solving  Lyapunov  Equation 

Consider  the  linear  system  represented  by 


dx 

dt 


Rx 


(4-12) 


where 


R  = 


AlC 


OB 


(4-13) 


where  A,  B,  C  are  n*n  matrices  whose  elements  are  complex 

rmo’ 


numbers.  The  matrix  R  is  similar  to  R*  = 


0|  B 


whenever 


the  equation  AX  -  XB  =  C  has  a  solution  X.  In  this  case 
it  is  clear  that 


TRT 


-1 


R* 


(4-14) 


where 
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■'«  ‘  ■  *»  -•-» 


T  =  [-?-*]  (4-15) 

where  X  is  a  solution  for  Lyapounov  equation  (4-1) . 

First  Technique.  Theorem  (4-1) :  Roth  (  ) 

A  necessary  and  sufficient  condition  that  equation 
(4-1)  has  a  solution  X  where  the  matrices  A,  B,  C  are 
square  matrices  of  order  n»n  with  the  elements  in  the  field 
of  complex  numbers  is  that  the  matrices  R  and  R*  given 
in  (4-13)  are  similar. 

Letting  f- ( \)  and  f D { X)  be  the  characteristic 
polynomial  of  A, B  respectively  then 

fO‘M]  [n'mI 

VR)  =  L5urJ  '  VR)  -  IfoJ  (4-16) 


Proof: 

The  first  part  Pf  the  theorem  is  clear  using  the 
following  equation 


[i  i  xl 

\i  !• 

-xl 

[ A  -AX+XB+c] 

Ifi 

R  1 

“  L°  i  B  "J 

is  a 

solution 

of 

(4-1)  then 

[i  >  xl 

p_ 

lo  L 

R 

Lo 

d 

a  Lo  i  =r* 

R  = 

p 

X  :  C 

1  f1  ;  xl 

[o  ■  ij 

l 

3  ’  E 

v  Lo  d 

(4-17) 


(4-18) 

(4-19) 
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MIA. 


microcopy  resolution  test  chart 

NATIONAL  BUREAU  OF  STANDARDS- 1963 -A 


(4-20) 


fA(R)  = 

:xi  °  1  p  •  xi 

L»  ij  °  *a(B)  L°  Ij 

.  J 

fA(R)  - 

o  j  -x^sil  , 

0  i  Ml 

L°  1  £a(b,J 

0  i  uj 

This  implies  that 

M  -  -XU 

In  the  same  way 

£„(»)  -ri.:-x1 1 

fB(A)|  0 

L 

0  Ij  | 

0  ffi(B | 

lo  j  ij 

pB(A)  |  fB(A)  _  x] 

fm  « 

/V  A  • 

g  N  i  M 

r  i 

O'  J 

L°  °. 

This  implies 


M  «  N«X 


Theorem  (4-2) 

The  equation  (4-1)  has  a  solution  X  if  the 
pair  of  equations  has  a  common  solution: 

M  +  XU  «  0 


(4-21) 


(4-22) 


(4-23) 


following 


(4-24) 
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(4-25) 


M  -  NX  =  0 

A  /V 

where  M,  U,  M  and  N  are  as  given  in  (4-16) .  Moreover, 
any  common  solution  will  be  a  solution  of  (4-1) . 

The  necessary  and  sufficient  condition  that  (4-1) 
has  a  solution  X  is  that  the  equations 

M  U“U  «  M 

and 

A  /\  A  A 

N  N~  •  M  a  M 

and 

M  U  =-N  m  hold. 

In  this  case  the  solution  will  be  expressed  as 

X  =  N~  •  M  -  M  U“  +  N-  •  N  M  U~  (4-29) 

Proofs 

The  first  part  of  the  theorem  is  clear  using  (4-22) 
and  (4-23)  in  theorem  (4-1) . 

Equations  (4-26)  and  (4-27)  are  the  consistency  con¬ 
dition  of  each  of  the  pair  of  the  equations  and  equation 
(4-28)  is  the  condition  that  the  two  equations  have  a 
common  solution. 

Example  (4-1) t  Solve  the  Lyapounov  equation  (4-1) 

for 


(4-26) 

(4-27) 

(4-28) 
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Check  of  consistency  conditions 


M  U“U  »  M 


A  solution  is 

* 

X  -  W  ~M  -  M  u”  +  N  ~ •  N  M  if" 

■MG  J-c :j 

•G  :1M[:  :3[;  a  ■[:  a 

Moreover,  we  can  find  the  general  solution  by  finding 
the  general  solution  for  each  equation.  The  general 
solution  for  the  first  equation  is* 
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pi  m 


X1  * 


0  -  M  D“  +  Jjll  -  0  f) 


(4-30) 


-  c  t  g  a  *  4  *J  -  c  a  g  :J) 


h  -a  i 

I  l-m  m_ 


where  L  ,m  are  arbitrary. 


A  A 


[-  -  N  *  M  -  (I-N”N)Y, 

i  4 


(4-31) 


r  i 

Lo  oJ 


2  2 
2  2 


*l 

0 


h  o 
o  o 


2  0 
2 


i  i 

X  < 


where  £' ,m'  are  arbitrary. 
Equating  X^  *  X2 

•£  *  1  ,  m*  *  m  ,  ■  l-m 


So  the  general  solution  will  be 


The  Second  Technique. 

Theorem  (4-3)  :  If  f^U),  f^(X)  are  polynomials 
of  degree  n  of  X  with  coefficient  in  the  field  of  complex 
numbers  such  that 


fa(R) 

fe(R) 


fv 

id 

1° 

i#] 

i] 

tjl 

Tj 

(4-32) 

(4-33) 


where 


Lo 


B 


,  if  V"1  exists,  then  a  solution  X  of 

:-i 


N-VX  *  0  is  A  solution  of  (4-1) .  Moreover,  if  M  exists, 

A  A 

then  a  solution  X  of  N  +  XM  »  0  is  also  a  solution  of 


(4-1)  . 

Proofs  The  matrices  fa(R)  and  R  commute  which  implies 


V 

* 

N 

0 

A 

m 

C 

A 

* 

C 

V 

N 

0 

o. 

.0 

B 

m 

as 

0 

m 

B 

— 

_0 

0_ 

VA 

VC+NB 

‘av 

AN 

0 

0 

¥ 

0 

- 

o 

0  _ 

This  implies  the  following  identities 


AV  -  VA  ,  AN  «  VC  +  NB 


(4-34) 
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if  X  is  a  solution  of  N-VX=0  ,  then,  using  (4-34) ,  the 
following  holds 


0  =  A (N  -  VX) 

*  VC  +  NB  -  AVX 
»  V(C  +  XB  -  AX) 

and  since  V-1  exists,  X  is  a  solution  of  (4-1) .  In 
the  same  way,  we  can  prove  the  second  part. 

Example  (4-2) 

Solve  the  same  problem  in  example  (4-1) 
f (X)  *  |R  -  XI l  »  (X2  -  X) (X2  +X) 

All  the  possible  polynomials  are: 


Case-1 

f  (X) 

-  X 

-  X 

Case-2 

f  (X) 

-  X2 

+  X 

Case-3 

f(X) 

-  X2 

Case-4 

f(X) 

-  X2 

-  1 

Case-1: 

i 

C’ c 

EE1 

■ 

a 

a 

m 

OKI 

D 

Q 

Li 

m 

K 

M 

m 
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’.c-i  v 


Consider  the  equation 


N  +  XM  *  0  ,  does  not  exist,  has  the  general 

solution 


X*  -  N  M  +  Y  (I  -  MM  ) 


J?t  m  arbitrary 


(4-35! 


Substituting  into  the  equation  (4-1)  by  this  solution, 
we  obtain  the  condition  on  1 ,m'  to  make  (4-35) a  solution 

Jl  ®  1  ,  m  arbitrary 

i.e.,  the  general  solution  is 


Case-2 


Consider  the  equation 


N  ■  VX  ,  V  is  singular,  has  the  general  solution 

X  »  V”  N  +  (I  -  V  v“)Y 

,  m'  arbitrary 

Again,  substituting  in  equation  (4-1)  we  have  the  same 
solution. 

Case-3 

f(A)  =  R2 

1  0  :  1  o" 

»  10  110 

o  6  •  o  r 

o  o  1  o  1 

The  theorem  cannot  be  applied. 

Case-4 

f (A)  -  R2  -  I 


The  theorem  cannot  be  applied. 
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So  the  previous  theorem  is  applicable  only  for  the  cases 
where 


f  (X)  is  either  f A ( X )  or  fB(X). 


The  Third  Technique 

In  this  part,  solutions  for  Lyapounov  equation  will 
be  obtained  in  terms  of  the  principal  idempotent  and 
nilpotent  matrices  associated  with  the  matrices  A,  B. 

To  solve  equation  (4-1) ,  the  following  more  general 
equation  will  be  considered 


AXE  +  DXB  *  C 


(4-36) 


If  A  is  an  n»n  matrix  having  elements  which  belong 
to  the  field  of  complex  numbers  and  {a^  is  the  set 
distinct  characteristic  roots  of  A,  then  A  has  the 
following  representation: 


( a jA j  +A j ) 


(4-37) 


where  the  matrices  {A^>  ,  {A^>  form  a  complete  set  of 

principal  idempotent  and  nilpotent  matrices  with  the 
following  property: 


All 

^i  Aj  “  3jAj  '  ^  Aj 


I  )  j  m  If  2  f  •  •  •  $ ro 


(4-38) 
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A. A.  =0  for  j  /  k  ,  A.  A.  =  A. A.  =  A  . 

J  K  J  J  J  J  J 


(4-39) 


Theorem  (4-4) 

Consider  the  following  matrix  equation 

AXE  +  DXB  =  C  (4-40) 


where  the  matrices  A,  E,  D ,  and  B  have  the  following 
representation  in  terms  of  a  complex  set  of  idempotent 
and  nilpotent. 
ja 


(Vj 


jb 


B 


3  +  BJ>  ’ 


3*1 


-Z  ,d^ 


3  +  DjJ  ! 


j*l 

je 

E  »  ^  (e^  +  Ej) 

j-1 


with 
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0 


A.C  -  CBk  - 


A  C  =  CEk  *  0 


DjC  *  CBk  =  0 
DX  =  CEk  ■  0 


for  all  possible  j,k  and  whenever 


a  .  e  .  +  d,b 
l  ]  k  e 


=  0  ,  then 


Ai  Dk  CEj  Be  3  ® 


The  equation 

(4-40) 

has  a 

solution 

X  given  by 

ii 

X 

jd 

ja 

y 

AiDkC  EjBe 

(4-41) 

e*l 

4 

j=l 

L 

i=l 

aiej+  dkbe 

Lemma  (4-1) 

The  solution  for  the  Lyapounov  equation  (4-1)  given 
that  A,B  are  given  as  in  (4-41)  is 


ib 

X  .  V 

<T*  AiC  B 

L 

2-  -  bs 

i=l 

Example  (4-1) :  Solve 

the  same  problem  given  in 

Example  (4^1  ). 
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Alc  b2  =  0 

A2C  Bx  ■  0 

and  the  solution  is 

X  =  A^C  Bl  +  A2CB2+1j  A3CB2+A3CB1 

6  17/2  1 

-4  -6  0 

2  3  0 

Check  that  X^B^)  -  X(£Bj)  *  (£Bj)X  »  0 
The  Fourth  Method 

This  method  is  sufficient  to  obtain  all  solutions 
for  the  Lyapounov  equation  (4-1) .  Equation  (4-1)  can  be 
written  in  the  vector  form  as 

F  x  -  c  (4-42) 

_  _  2  2  2 
where  x#c  are  n  *1  elements  and  F  will  be  n  •  n  .  This 

method  is  obviously  not  suitable  for  larqe  n. 

2*2 

For  A  and  B  and  C  e  C  and 


104 


Example  (4-5) :  Considering  the  same  problem  in  example 


Solving  the  system 

Fx  =  c 


we  obtain  the  general  solution 

4«1  1-1 

X  ■  (ST)  c  +  N  Z 


consisting  condition  ho<Ids 


106 
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V  Conclusion  and  Recommendations 


Conclusion 

An  algorithm  for  computation  of  various  kinds  of 
generalized  inverses  is  established  for  the  matrices  over 
the  field  of  complex  numbers.  The  existence  and  compu¬ 
tation  of  various  kinds  of  generalized  inverses  over  the 
ring  of  polynomials  in  several  variables  are  studied. 
Equivalence  of  a  matrix  to  its  Smith  form  over  the  rinq 
of  polynomials  in  several  variables  is  studied.  A  new 
algorithm  for  finding  the  solution  of  Ax  *  b  over  the 
field  of  polynomials  in  several  variables  is  established. 

Recommendations 

1.  Implementation  of  these  algorithms  on  computer. 

2.  Study  of  sufficient  and  necessary  conditions 
for  a  matrix  over  the  ring  of  polynomials  in  several 
variables  to  be  equivalent  to  its  Smith  form. 

3.  Explicit  solution  of  Lyaponov  and  Ricate  equa¬ 

tion  in  terms  of  generalized  inverses.  Extension  of 
Jones  work  £15}  ,  fltij  ,  • 

4.  Applications  in  the  field  of  control  theory. 
Extention  of  the  work  ofi 

a.  Frost  and  Storey  (Contrability  and  Obser¬ 
vability)  [l)J  ,  [til 

b.  Da*  and  Ghoshal  (Construction  of  Reduced- 
order  Observes)  £7}  . 
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c. 


'J 
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Appendix  A 

Basic  Applications  of  Generalized  Inverses 

Solution  of  linear  equation  Ax=y: 

Theorem  (A-l) : 

A  necessary  and  sufficient  condition  that  Ax=y 
is  consistent  is  that 


AA^y  =  y  (A— .1. ) 

The  general  solution  of  the  consistent  equation  is 

x  *  A2y  +  (I  -  AjA)Z  ,  (A-2) 

where  Z  is  an  arbitrary  vector. 

Proof  s 

Sufficiency:  if  (A-l)  is  true,  then  A^y  is  a 
solution. 

Necessity:  Suppose  that  Ax  ■  y  is  consistent, 

then  there  exists  w  such  that 

Aw  »  y 

AA^ (Aw)  ■  y 

aax  y  ■  y  . 

A-l 


To  complete  the  proof  it  is  sufficient  to  prove  that  (A-2) 
is  a  solution  for  Ax=*y.  Substituting  (A-2)  into  the 
equation  Ax«y  we  have 


Ax  -  A(Axy  +  (I-A^A) Z) 

■  AAjy  +  AZ-AA^AZ 

=  y  +  ZA  =  AZ 

*  y 

To  prove  that  any  solution  x  can  be  derived  from 
(A-2) ,  we  can  choose  Z  as  follows: 

Z  m  X  -  Gy 

x  «  Gy  +  (I-GA)Z  -  Gy  +  (I-GA)  (x-Gy) 

=»  Gy  +  X  -  Gy  -  GAx  +  GAGy  ■  X  -  GAX  ♦  GAGAX 

*  X  -  GAx  +  GAx  -  x 
Theorem  (A-2) : 

The  necessary  and  sufficient  condition  that  the 
equation  AXB«C  has  a  solution  is  that 

AA.CB.B  -  C  ,  (A-3) 


in  which  case  the  general  solution  is 


X  =  AjCBj,  +  Z  -  AjAZBBj^  (2-4)  (A-4) 

where  Z  is  an  arbitrary  matrix. 

Proof  s 

Sufficiency  is  trivial  since  A^CB^  is  a  solution. 
Necessity  proof:  if  the  equation  is  consistent,  then 
there  exists  X  such  that 

AXB  +  C 

AAj^AXBJBjB  »  C 
AA^CBj'B  *  C  . 

Substituting  X  given  by  (A-4)  in  AXB  ,  we  have 
A(AjCB^+Z-AjAZBB^)B  =  C+AZB-AZB-C.  Any  solution  of 
AXB«C  is  obtainable  through  (A-4)  by  a  suitable  choice 
of  Z.  For  example,  X  can  be  obtained  if  we  put 
Z  -  X-AjCBj 

Solution  »  A^CB|  +  (x-AjCB)  -  A jA (x-A^CB^ ) BB^ 

■  X  -  A^AxBB^  +  A^  A  A^  C  Bj  B  B^ 
■X-A^AxBB1+  A^C  B^ 


A- 3 


X 


=  x  -  C  +  A1CB1  = 


Theorem  (A-3) 

Let  A(m»n)  ,  c(m»P)  ,  B(p.g)  ,  D(n.g)  be  given 
matrices.  A  necessary  and  sufficient  condition  for  the 
consistent  equations  AX=C  ,  XB=D  to  have  common 
solution  is  that 

AD  =  CB  , 


in  which  case  the  general  expression  for  a  common  solution 
is 


X  =  A^C  +  DBl  -  AjADBj^  +  (I-AjAJZ  (I-BB^ 


where  Z  is  arbitrary. 
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